optimize fp8 inference with AngelSlim by GGgary666 · Pull Request #132 · Tencent-Hunyuan/HunyuanImage-2.1

GGgary666 · 2025-10-28T14:59:22Z

optimize fp8 inference with (AngelSlim)[https://github.com/Tencent/AngelSlim]

yghstill · 2025-10-28T15:27:34Z

 model_name = "hunyuanimage-v2.1"
-pipe = HunyuanImagePipeline.from_pretrained(model_name=model_name, use_fp8=True)
+# Supported fp8_mode: weight_only, w8a8
+pipe = HunyuanImagePipeline.from_pretrained(model_name=model_name, fp8_mode="w8a8")


这里增加一行weight_only的调用代码，可以注释掉

optimize fp8 inference with AngelSlim

0a0b241

yghstill reviewed Oct 28, 2025

View reviewed changes

update readme with a fp8 example

01fc561

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize fp8 inference with AngelSlim#132

optimize fp8 inference with AngelSlim#132
GGgary666 wants to merge 2 commits intoTencent-Hunyuan:mainfrom
GGgary666:angelslim_fp8_support_1028

GGgary666 commented Oct 28, 2025

Uh oh!

yghstill Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

GGgary666 commented Oct 28, 2025

Uh oh!

yghstill Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants